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PREFACE 


This report document* the development of a universal four- 
channel water discriminant for the Landsat-3 mul tispectral 
scanner (MSS). The report is divided into two volumes. Part 1 
describes the approach# data# preprocessins# analysis# and re- 
sults. Part 2 contains technical appendices Iistin9 the input 
data# software# and computer-senerated output. 
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1, PURPOSE 


Mu) ti spectra I scanner's onboard NASA unmanned Landsat satellites 
provide valuable sources of current data for earth resources 
appl icat ions* A serious problem in utilizing Landsat data is 
the cost and difficulty of developing an individual set of sig- 
natures for each scene to be processed. This report describes 
the development of a 'universal' water signature for use with 
Landsat-3 mu) tispectral scanner (MSS) digital data. 


1.1 BACKGROUND 


The U. S. Army Corps of Engineers is resumed by Public Law 92- 
367 to inventory and inspect qualifying non-federal dams. The 
qua! i f icat ions are based on a combination of structure height 
and volume of water impounded. The Corps has been using surface 
water maps generated by the Detection And Mapping (DAM) package 
from Landsat-1 and Landsat-2 mul t ispectral scanner (MSS) digital 
data to help update the existing inventory of dams. The set of 
spectral signatures being used to classify water were previously 
developed from Landsat-1 data in Texas and Landsat-2 data in 
Alabama. They are not useable with Landsat-3 MSS data due to 
differences in sensor response and calibration# 


1.2 PERFORMANCE CRITERIA 


Manual procedures documented as part of the DAM package elim- 
inate certain urban features and terrain shadows incorrectly 
classified as water by the computer processing. The final 
classification results required by the Corps of this combined 
computer and manual processing are stated as percentages of the 
total number of water bodies ten acres or larger* 

A 90 percent or better correct detection of water bodies 10 
acres or larser 

A 10 percent or fewer 'false alarms' < groups of non-water 
pixels misclassified as water bodies) 


Although some proportion of impoundments less than 10 surface 
acres are expected to qualify for inclusion in the inventory, no 
criteria for their detection have been specified by the Corps. 
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2 , 


STUDY DESIGN 


2,1 CONSTRAINTS 


This study was limited by the availability of hardware, soft- 
ware, and human resources supporting or available to the Earth 
Observations Division at the NASA Johnson Space Center in Hous- 
ton, Site selection was limited to areas covered by retrospec- 
tive data only, The Earth Resources Interactive Processing Sys- 
tem (ERIPS) at JSC, the Bendix-100 digitizing system at JSC, and 
the On-Line Pattern Analysis and Recognition System (OLPARS) at 
Rome Air Development Center were not available for use in this 
study, Technical problems prevented the use of EOD LARSYS, No 
resources were available for field trips to any of the study 
si tes, 


2,2 APPROACH 


Most of the United States surface area is neither water nor com- 
posed of materials which are spectrally near to water in the 
Landsat-3 MSS channels, The approach taken in this study is to 
select diverse study sites which have significant numbers of 
water bodies between two and fifty acres in size, to select only 
portions of thtpse sites which have the greatest density of water 
bodies, and then to analyze intensively only those pixels which 
are spectrally near to water, 


Figure 1 illustrates this approach, Preprocessing of both the 
Landsat and air photo data is designed to select only ground 
features and pixels which are water or (spectrally) near-water, 
and to eliminate other features and pixels from subsequent pro- 
cessing, To further reduce the effect of pixels lying far from 
the water/non-water spectral boundary, a linear discriminant 
hyperplane is fitted iteratively, with correctly classified pix- 
els far from the hyperplane receiving less weight on subsequent 
i terat ions, 
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3* DATA AND PREPROCESSING 


3.1 SITE AND FRAME SELECTION 


Great car* wa* taken to select a set of study sites which varied 
in terrain# climate# ves&t ion# time of year# and sun elevation 
ansle. Nevertheless# all study sites were required to meet the 
following criteria! 

* Landsat and color infrared air Photo coverage within one 
week of each other 

A No significant precipitation immediately before or be- 
tween the Landsat and air photo coverage 

* No extensive clouds or haze in the parts of the Landsat 
scene covered by air photos 

* No scenes with sun elevation angles less than 35 degrees 

A Significant number of water bodies between two and fifty 
acres 

A Some (spectrally) near-water features# such as intensive 
urban# industrial# terrain shadows 

A Published maps available at, It 24 #000 or 1«62#50Q 


An approximate count of water bodies by size was made for every 
air photo frame in each site. Based on this count# the variety 
of shape and color# and the presence of likely 'near-water' 
features# individual frames (not necessarily contiguous) were 
selected within each site. 


Figure 2 lists the selected sites# together with information on 
the Landsat and air photo data. 


3,2 AIR PHOTO PREPROCESSING 


Each of the 28 selected air photos was individually rectified 
and enlarged to a scale of li24>000 using published 7.5 and 15 
minute guadransle maps for control. All water bodies over two 
acres were then delineated on a mylar sheet over each rectified 
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Figure 2, - Landsat Beenes and aerial photography. 
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frame* Wh#n necessary* the quadrangle mars were used and the 
roll film was interpreted usin* a stereoscope* The area of aach 
dal inaatad water body lass than 50 acres was than measured with 
a digitizer. 


Each rectified air photo was also overlaid with tha correspond- 
ins quadrangle maps* and tha UTM coordinates of aach photo cor- 
ner recorded for later use in generating matchins computer dis- 
plays and maps* 


3*3 LANDSAT PREPROCESSING 


A sat of potential control points was selected for each of the 
four Landsat scenes* Displays of Landsat data for these points 
ware visually correlated with the corresponding quadrangle maps 
and both Universal Transverse Mercator (UTM) coordinates and 
scanner coordinates measured for each point* The control net- 
work for each scene was then adjusted us ins the CONTROL program 
in the DAM package* Root -mean-square (RMS) errors for the net- 
work adjustments of these scenes ranged from 53 meters to 64 
meters* 


A quick test with Landsat-3 data soon demonstrated that the 
Landsat-2 series of water signatures used with the DAM package 
were not adequate to generate even the preliminary water-ness 
displays needed to screen out the obvious non-water pixels* 
Cluster means from four Landsat-3 scenes near the Sabine River 
in east Texas* previously generated and labelled for another 
project* were used to fit an approximate water-ness transforma- 
tion using MSS channels i (Band 4) and 4 (Band 7)* 


Approximate water-ness displays were now generated for each rec- 
tified air photo by the DAM package* using this transformat ion 
and the previously measured photo corner UTM coordinates* 
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4. ANALYSIS 


Although this chapter is divided into distinct sections covering 
the different tyres of analysis performed on the preprocessed 
data# much of this worK was actually performed in parallel* 
Thus# analysis of preliminary spectral plots or the performance 
of a preliminary discriminant fit for Just one photo frame# or 
for Just the frames in one site# typically helped to Point out 
label I ins errors# or even preprocessing errors. 


4*1 GROUND FEATURE SELECTION AND LABELLING 


Usins the approximate water-ness displays# rectified air photos# 
and the delineated overlays# approximately 1300 pixels# spread 
approximately equally between water bodies and non-water fea- 
tures spectrally near to water# were identified and labelled* 


The Labels and their meanings are shown in Figure 3. The label 
MX <fur mixture pixels) was not used original ly but was added to 
relabel those water pixels which# upon closer scrutiny# appeared 
to be edse pixels rather than solely water pixels. 


The water-ness displays senerated with the approximate Sabine 
transformation were used to label the Louisiana# Nash ins ton# and 
New YorK sites. The displays of the sparsely vesetated Montana 
site were not adequate far label I ins this arid site since they 
showed far too many water and 'near-water' pixels. The Montana 
displays were re-run usins a revised transformation derived from 
a linear discriminant fitted in a sinsle pass to labelled pixels 
from the other three sites. 


4.2 PAIRWISE SPECTRAL PLOTS 


Various pairwise spectral plots were produced# by grouping la- 
bels together into classes and then assigning symbols to the 
classes. These spectral plots were used to detect labelling 
errors and to help develop an intuitive feel for the data. 


Figure 4 shows two spectral plots# one of channels 1 and 4# the 
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CL ■ Cloud 
SH * Shadow 
BS ■ Bare Sol I 
SD * Sand Dunes 
ML « Wetlands 
PA ■ Pavement 
BR *» Bui (dins Roofs 
CO * Commercial 
IN * Industrial 
DT «= Downtown 
TP * Treatment Ponds 
RL * River Light Blue 
RM * River Medium Blue 
RD * River Dark Blue 
P,L B Pond Light Blue 
PM « Pond Medium Blue 
PD » Pond Dark Blue 
LL b Lake Light Blue 
LM = Lake Medium Blue 
LD = Lake Dark Blue 
MX B Mixture pixels 

Figure 3. - Pixel labels# 
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Figure 4. 


Spectral Plots of channel 4 with channels 1 and 
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other of channels 2 and 4* Theae plots are derived from all 
1300 points on all 28 frames. *H* symbolizes one or more non- 
water pixels# symbolizes one or more water pixels# and '2' 
symbolizes at least one non-water pixel AND one water pixel at 
the same location in the spectral plane. (The circles drawn 
around some spectral locations are explained in the next sec- 
tion.) Not surprisingly# channel 4# the farthest of the near 
infrared channels# is the best sinsle channel to separate water 
from non-water and channels 1 and 4# the least correlated of the 
Landsat MSS channels# are the best two channels. 


Figure 5 shows a spectral plot of channels 3 and 4. Notice that 
although these two channels are highly correlated# lying very 
nearly on a straight line# still the water and non-water pixels 
overlap very little on this plot. In spite of their high cor- 
relation# this pair of channels discriminates far better than 
the less correlated pair of channels 2 and 4 in Figure 4. 


4.3 WEIGHTED LINEAR DISCRIMINANT FIT 


All the pixels with water labels were grouped into class 1 and 
all the non-water pixels (except shadows and mixture) were 
grouped into class 2. The first iteration to fit a discriminant 
hyperplane between the two distributions was performed giving 
the same weight to all pixels. On subsequent iterations# pixels 
closer to the hyperplane (and therefore in danger of being mis- 
classified) received more weight than pixels farther away which 
were in no danger of being misc lassi f ied» 


Figure 6 shows the coefficients after the final iteration. On 
the left side of the figure is a general summary of how this 
discriminant classified all the pixels. On the right side is a 
detailed summary of how the discriminant classified those pixels 
close to the hyperplane (including all misclassi f ied pixels). 


A total of 9 out of 700 water pixels (or 1.3 percent) and 8 out 
of 574 non-water pixels (or 1.4 percent) are misclassif ied by 
this discriminant. Figure 7 lists all 17 misc lassi f ied pixels. 
Several are isolated pixels in the Mississippi River. It was 
not possible to determine whether any of these might have been 
associated with barges in the river. The bauxite mill and com- 
mercial building roofs are in industrial and commercial areas 
and probably could have been eliminated by an analyst without 
referring to any supplemental data sources other than the pub- 
lished quadrangle maps. The dark sand dune faces in eastern 
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STATE 

FRAME 

• 

FEAT 

• 

1 

CHANNEL 
2 3 

4 

CLASS 

LABEL 

DISCR 

FEATURE 

DESCRIPTION 

LA 

9734 

504 

21 

24 

21 

8 

1 

RL 

-1.4 

MISS, RIVER 

LA 

9734 

504 

19 

24 

23 

8 

1 

RL 

-1.4 

MISS, RIVER 

LA 

9691 

SOS 

22 

30 

30 

9 

1 

RM 

-1.0 

MISS. RIVER 

LA 

9691 

507 

19 

24 

23 

8 

1 

RM 

-1.4 

MISS, RIVER 

LA 

9691 

507 

21 

24 

21 

8 

1 

RM 

-1.4 

MISS, RIVER 

WA 

1440 

609 

21 

21 

19 

7 

2 

IN 

0. 7 

BAUXITE MILL 

WA 

1440 

502 

23 

23 

23 

7 

2 

BR 

2.6 

DK BLDG ROOF 

WA 

1440 

502 

21 

21 

20 

7 

2 

BR 

1.0 

DK BLDG ROOF 

WA 

1440 

502 

22 

21 

20 

7 

2 

BR 

1.3 

DK BLDG ROOF 

WA 

1440 

508 

21 

20 

20 

7 

2 

BT 

0.9 

DK BLDG ROOF 

WA 

1535 

514 

29 

31 

29 

9 

n 

SD 

1.0 

DK DUNE FACE 

WA 

1535 

519 

28 

27 

24 

8 

2 

SD 

1.8 

DK DUNE FACE 

NY 

076 

70 

21 

14 

18 

a 

1 

LD 

-2.6 

DK LAKE 

NY 

076 

20 

21 

16 

21 

8 

1 

LD 

-1,7 

DK LAKE 

NY 

076 

20 

21 

15 

18 

8 

1 

LD 

-2.6 

DK LAKE 

NY 

076 

20 

21 

17 

20 

8 

1 

LD 

-1.9 

DK LAKE 

MT 

9100 

540 

18 

18 

21 

7 

2 

BS 

0.2 

DK BARE SOIL 


CLASS 1 « WATER 


CLASS 2 » NON-WATER 


Figure 7. - Pixels misc I assi f ied by linear discriminant* 
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Washington are in the vicinity of Pothole* Lake# They are spec- 
trally identical in ell four MSS channels to some interior (non- 
ed»e) water pixels* The pixels in the dark lake (feature number 
20) in upstate New York are spectrally different from typical 
water pixels* 


The misclassi f ied pixels listed in Figure 7 are circled on the 
spectral plots in Figures 4 and 5. Many circles occur over 2's* 
where the same spectral values in these channels are shared by 
both a water pixel and a non-water pixel* Note that any poss- 
ible linear two-channel spectral limits would have more classi- 
fication errors than the four-channel linear discriminant hyper- 
p| ane* 


4.4 PER FEATURE AND SHIFTING TABULATION 


Based on the discriminant coefficients reported in section 4.3# 
a new series of water-ness displays was generated in which each 
increment in symbols displayed indicated a shift* or transla- 
tion* of the hyperplane corresponding to a change of 0.2 in the 
bias term of the discriminant. The water bodies on these dis- 
plays were tabulated by number of pixels for ten different 
shifted positions of the hyperrlane* Figure 8 shows that for 
water bodies of ten acres or greater the unshifted discriminant 
is best* with a per feature false alarm rate of 8.7 percent and 
a miss rate of 8.7 percent* based on the occurrence of one or 
more pixels. The unshifted discriminant (with a bias term of 
7.128) exceeds the performance criteria and is recommended. 

The miss rate of 8.7 percent is an average for all 414 water 
bodies 10 acres or larger in surface area. The discriminant had 
a higher miss rate for the smaller water bodies in this group 
and a lower miss rate for the larger water bodies in this group* 
as may be seen from Figure 9. 

The number of pixels detected per water body also varied with 
surface area* as shown in Figure 10. However* differences in 
shape and in pixel alignment with respect to the water body in- 
troduce considerable random variation* although the mean number 
of pixels in a 10 acre water body is approximately three* the 
standard deviation is nearly 2.5 pixels. These data suggest the 
difficulty of attempting to estimate the area of small water 
bodies from Landsat MSS pixel counts. 

By using the complete tabulations in Appendix-G* it is possible 
to estimate the performance of any of the shifted discriminants 
for water bodies of other size ranges. Likewise* performance 


SYMBOLS 

BIAS 

TERM 

FALSE 

ALARMS 

NUMBER PERCENT 

MISSES 

NUMBER PERCENT 

0 

to 

0 

6.728 

28 

6. ax 

36 

8,7% 

0 

to 

1 

6.928 

35 

8. 4X 

36 

8.7% 

0 

to 

2 

7.128# 

36 

8. 7% 

36 

8,7% 

0 

to 

3 

7.328 

38 

9.2% 

36 

8,7% 

0 

to 

4 

7.528 

45 

10,9% 

35 

8,4% 

0 

to 

5 

7*728 

48 

11.6% 

34 

8,2% 

0 

to 

6 

7.928 

52 

12,6% 

32 

7,7% 

0 

to 

7 

8.128 

62 

15.0% 

30 

7.2% 

0 

to 

a 

8.328 

74 

17,9% 

29 

7,0% 

0 

to 

9 

8.528 

91 

22.0% 

27 

6,5% 


# Bias term from weighted linear discriminant fit 
(414 water bodies 10 acres or larger) 


Figure 8, - Effect of bias term on 10 acre water bodies 
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WATER BODIES 


SIZE 

MISSED 

TOTAL 

(acres) 

« 

X 

« 

2 - 

2.9 

299 

92 

326 

3 — 

3.9 

145 

77 

186 

4 - 

4.9 

98 

69 

143 

5 - 

5.9 

60 

55 

109 

6 - 

6.9 

26 

41 

64 

7 - 

7.9 

28 

49 

57 

a - 

8.9 

11 

21 

52 

9 - 

9.9 

9 

24 

38 

10 - 

10,9 

10 

22 

46 

11 - 

11.9 

5 

20 

25 

12 - 

12,9 

5 

18 

28 

13 - 

13.9 

1 

7 

14 

14 - 

14,9 

2 

9 

23 

15 - 

15.9 

1 

7 

15 

16 - 

16,9 

2 

15 

13 

17 - 

17,9 

3 

19 

16 

18 - 

18.9 

0 

0 

12 

19 - 

19.9 

1 

9 

11 

20 - 

20.9 

1 

13 

8 

21 - 

999 

5 

2 

203 


WATER BODIES 


SIZE 

MISSED 

TOTAL 

(acres) 

• 

X 

• 

2 - 

999 

7 12 

51 

1391 

3 - 

999 

413 

39 

1065 

4 - 

999 

26 8 

31 

877 

5 - 

999 

170 

23 

734 

6 - 

999 

110 

18 

625 

7 - 

999 

84 

15 

561 

8 - 

999 

56 

11 

504 

9 - 

999 

45 

10 

452 

10 - 

999 

36 

9 

414 

11 - 

999 

26 

7 

368 

12 - 

999 

21 

6 

343 

13 - 

99? 

16 

5 

315 

14 - 

999 

15 

5 

301 

15 - 

999 

13 

5 

278 

16 - 

999 

12 

5 

263 

17 - 

999 

10 

4 

250 

18 - 

999 

7 

3 

234 

19 - 

999 

7 

3 

222 

20 - 

999 

6 

3 

211 

21 - 

999 

5 

2 

203 


Figure 9. - Discriminant performance* by size of water body 
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SIZE OF 

NUMBER OF 

PIXELS PER 

WATER BODY 

AVERAGE 

WATER 




ACRES 

BODY 

STANDARD 


MOVING 

PER 

< acres) 

DEVIATION 

MEAN 

AVERAGE 

PIXEL 

2 - 2.9 

0.38 

0.10 

pm CJf 

mm mm 

3 - 3.9 

0.95 

0,38 

0,32 

10,9 

4 - 4.9 

0.81 

0.48 

0.54 

8.3 

5 - 5.9 

1.02 

0.77 

0,93 

5,9 

6 - 6.9 

1.71 

1.55 

1.23 

5.3 

7 - 7.9 

1.86 

1,37 

1,68 

4, 5 

a - 8.9 

1.89 

2.13 

2,07 

4,1 

9 - 9.9 

2.40 

2.71 

2,72 

3. 5 

10 - 10.9 

2,47 

3.30 

3,11 

3.4 

11 - 11.9 

3.25 

3,32 

3,20 

3,6 

12 - 12.9 

2,38 

2.96 

3.21 

3.9 

13 - 13.9 

2.06 

3.36 

3,77 

3,6 

14 - 14.9 

3.59 

5.00 

4.79 

3.0 

15 - 15.9 

3.64 

6,00 

5,26 

2,9 

16 - 16.9 

3.65 

4.77 

5.61 

2.9 

17 - 17.9 

4.31 

6.06 

6,11 

2,9 

18 - 18.9 

3.70 

7.50 

6.94 

2.7 

19 - 19.9 

3.41 

7.27 

6,97 

2,9 

20 - 20.9 

3.31 

6.12 

7.16 

2.9 

21 - 21.9 

4.11 

8.09 

7,81 

2,8 

22 - 22.9 

2.94 

9.20 

9.35 

2,4 

23 - 23.9 

5.01 

10.75 

- - 

mm. pm 


Figure 10. - Number of pixels detected! by size of water bod> 
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can be estimated if* to further minimize tha already acceptable 
falsa alarm rate* we wish to require wore than ona contiguous 
rixel for a watar body to qualify* Sine# tha statad performance 
critaria ara mat by tha original fitted discriminant on a single 
rixel basis* such devices have been not been needed in the pre- 
sent study* 
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5# CONCLUSIONS 


Th# techniques used in screening th# data and in iteratively 
fit tin* th* discriminant to rix*l values weighted by th*ir 
closeness to th* hyperrlan* *»rovid*d a signature which appears 
to be optimum at least with respect to translation of th* hyper- 
plane* Tim* and resources did not permit th* evaluation of th# 
•ffect on discriminant performance of different rotations of the 
hyperrl an** 


Intermediate discriminants derived from data for only two sites 
did far less well in classi fyins the remaining sites* suggesting 
that a sufficiently large and diverse sample is very important 
in attempting to develop truly 'universal ' signatures. 
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